Speech-Driven Expressive Talking Lips with Conditional Sequential Generative Adversarial Networks

نویسندگان

چکیده

Articulation, emotion, and personality play strong roles in the orofacial movements. To improve naturalness expressiveness of virtual agents (VAs), it is important that we carefully model complex interplay between these factors. This paper proposes a conditional generative adversarial network, called sequential GAN (CSG), which learns relationship emotion lexical content principled manner. uses set articulatory emotional features directly extracted from speech signal as conditioning inputs, generating realistic A key feature approach speech-driven framework does not require transcripts. Our experiments show superiority this over three state-of-the-art baselines terms objective subjective evaluations. When target known, propose to create emotionally dependent models by either adapting base with data (CSG-Emo-Adapted), or adding conditions input (CSG-Emo-Aware). Objective evaluations improvements for CSG-Emo-Adapted compared CSG model, trajectory sequences are closer original sequences. Subjective significantly better results when happiness.

برای دانلود رایگان متن کامل این مقاله و بیش از 32 میلیون مقاله دیگر ابتدا ثبت نام کنید

اگر عضو سایت هستید لطفا وارد حساب کاربری خود شوید

منابع مشابه

Speech-Driven Facial Reenactment Using Conditional Generative Adversarial Networks

We present a novel approach to generating photo-realistic images of a face with accurate lip sync, given an audio input. By using a recurrent neural network, we achieved mouth landmarks based on audio features. We exploited the power of conditional generative adversarial networks to produce highly-realistic face conditioned on a set of landmarks. These two networks together are capable of produ...

متن کامل

Context-conditional Generative Adversarial Networks

We introduce a simple semi-supervised learning approach for images based on in-painting using an adversarial loss. Images with random patches removed are presented to a generator whose task is to fill in the hole, based on the surrounding pixels. The in-painted images are then presented to a discriminator network that judges if they are real (unaltered training images) or not. This task acts as...

متن کامل

Bidirectional Conditional Generative Adversarial Networks

Conditional Generative Adversarial Networks (cGANs) are generative models that can produce data samples (x) conditioned on both latent variables (z) and known auxiliary information (c). We propose the Bidirectional cGAN (BiCoGAN), which effectively disentangles z and c in the generation process and provides an encoder that learns inverse mappings from x to both z and c, trained jointly with the...

متن کامل

Conditional Generative Adversarial Nets

Generative Adversarial Nets [8] were recently introduced as a novel way to train generative models. In this work we introduce the conditional version of generative adversarial nets, which can be constructed by simply feeding the data, y, we wish to condition on to both the generator and discriminator. We show that this model can generate MNIST digits conditioned on class labels. We also illustr...

متن کامل

ذخیره در منابع من


  با ذخیره ی این منبع در منابع من، دسترسی به آن را برای استفاده های بعدی آسان تر کنید

ژورنال

عنوان ژورنال: IEEE Transactions on Affective Computing

سال: 2021

ISSN: ['1949-3045', '2371-9850']

DOI: https://doi.org/10.1109/taffc.2019.2916031